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ABSTRACT 

We present measurements of the masses of a sample of 25 moderate X-ray luminosity clusters of galaxies 
from the 160 square degree ROSAT survey. The masses were obtained from a weak lensing analysis of deep 
F814W images obtained using the Advanced Camera for Surveys (ACS). We present an accurate empirical 
correction for the effect of charge transfer (in)efficiency on the shapes of faint galaxies. A significant lensing 
signal is detected around most of the clusters. The lensing mass correlates tightly with the cluster richness. 
We measured the intrinsic scatter in the scaling relation between M2500 and Lx to be cr\ ogLx \ M = 0.23^qq4. The 
best fit power law slope and normalisation are found to be a = 0.68 ±0.07 andMx = (1 .2 ±0.12) x /i7, I ) lO I4 M0 
(for Lx = 2 x 10 ffm er g/s)- Thes e results agree well with a number of recent studies, but the normalisation 
is lower compared to the study of Rvkoff et al. (2008a). One explanation for this difference may be the fact 
that (sub)structures projected along the line-of-sight boost both the galaxy counts and the lensing mass. Such 
superpositions lead to an increased mass at a given Lx when clusters are binned by richness. 
Subject headings: cosmology: observations - dark matter - gravitational lensing 



1. INTRODUCTION 

Clusters of galaxies are known to exhibit correlations be- 
tween their various observable properties, such as the well- 
known relation between X-ray luminosity and X-ray temper- 
ature. These scaling relations are the result of the various pro- 
cesses that govern the formation and subsequent evolution of 
galaxy clusters. Hence, the study of cluster scaling laws pro- 
vides the basis of testing models for the formation of clusters 
of galaxies and of galaxies themselves. For instance, N-body 
codes can nowadays reliably and robustly predi ct the evolu- 
tion o f the mass function of cluster halos (e.g., lEvrard et al.l 
2002). Hydrodynamic simulations and semi-analytic tech- 
niques then allow us to pred ict the X-ray and optical ap- 
pearance of these halos (e.g., I Voitl 120051 iNagai et all 12007b 
iBoweret al.ll2008h . 

The mean relation and the intrinsic dispersion of the re- 
sulting mass-observable relations provide a test of the ade- 
quacy of the physical processes included in the simulations: a 
realistic simulation should reproduce the mean relation, the 
intrinsic scatter, and its evolution (if any). Such compar- 
isons have recently led to the realization that cluster simu- 
lations must include radiative cooling and feedback from su- 
pernovae and AGNs in ord er to s ucces sfully explain the ob- 
served scaling relations (see I Voitl d2005) for a review). While 
the core regions of clusters remain difficult to model accu- 
rately, high-resolution simulations with cooling and feedback 
now produce clusters whose X-ray properties agree well with 
those of observed clusters outside the central 100 kpc or so 
(Nag ai et al.ll2007l) . 
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An empirically determined mass scaling relation is not only 
useful to directly test our understanding of cluster physics, 
it also allows one to relate the observables directly to the 
mass of cluster-sized halos. The latter, for instance, en- 
ables one to transform the observed luminosity function 
into a mass function, which in turn can be compared to 
different cosmological models. Such studies have already 
provided interesting constraints on cosmological para meters 
which complement constraints from other probes (e.g., Henry 



2000tlBorgani et al.l200UlReiprich & Bo hringer 2002; Henry 
2004lGladders et ai.ll2007t iHenrv et al.ll2009MVikhlinin et al 
2009t>t lMantz et aljioiObi 

To determine cluster masses, a number of methods are 
available. Dynamical techniques, such as the assumption 
of hydrostatic equilibrium in X-ray observations, have been 
widely used. The inferred masses, however, are likely to 
be systematically underestimated by up to 20% because of 
pressure support from turbule nce and energy input from ac- 
tive galactic nucle i (e.g.. lEvrard et al.l fl996: Rasia et al.H2006t 
Nagai et al. 2007). Masses derived from weak gravitational 
lensing do not require assumptions about the dynamical state 
of the cluster and can in principle be compared directly to the 
outcome of numerical simulations. It is worth noting, how- 
ever, that lensing is sensitive to all structure along the line-of- 
sight. Weak lensing is now a well-e stablished technique (for 
a review see Hoeks tra & Jainl 12008) and recent work has in- 
deed provided support for non-hydrosta tic gas in the outskirt s 
of X-ray luminous clusters of galaxies (Mah davi et alj 2008). 

To date, most work has focussed o n the most massive 
clusters at intermediate redshifts (e.g., iBardeau et al.l 120071 
iHoekstral 120071 iZhang et al.ll2008l) . because of the relatively 
large lensing signal. In this paper we focus on extending the 
mass range towards clusters with lower X-ray luminosities 
(L x < few 10 44 erg/s). To detect the lensing signal of these 
lower mass clusters, the number density of lensed background 
galaxies needs to be increased significantly, compared to deep 
ground based observations. The reason for this requirement is 
that the error in the weak lensing mass estimate is set by the 
intrinsic shapes of the galaxies and their number density. 

To this end, we have conducted a weak-lensing analysis 
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on images obtained during a snapshot survey of 25 moder- 
ate luminosity X-ray clusters with the Advanced Camera for 
Surveys (ACS) on board the Hubble Space Telescope (HST). 
These clusters were randomly selected fr om the ROSAT 160 
Square Degree (160S D) Cluster Catalog dMullis et all 120031: 
Vikhli nin et alJ 119981) . The clusters in the 160SD sample 
were serendipitously discovered in pointed, relatively wide- 
field (30' diameter) observations of the Position-Sensitive 
Proportional Counter (PSPC) on board the ROSAT X-ray ob- 
servatory (1990-1999). Seventy-two X-ray clusters between 
Z = 0.3 - 0.7 were discovered (see Fig. [p. All h ave been iden- 
tified and assigned redshifts (Mullis et al. 2003). Because this 
survey included pointed observations that were quite long, 
some of these clusters are among the faintest clusters of galax- 
ies known at these moderately high redshifts. Therefore, our 
new weak lensing measurements extend the X-ray luminosity 
limit of the mass-Z^ relation by almost an order of magni- 
tude, based on targeted observations. We note that studies of 
the ens emble-averaged pro perties of clusters discovered in the 
SPSS dRvkoffetal]|2008al) and X-ray groups in COSMOS 
(Leauthaud et al. 2010) have pushed the limits to even lower 
luminosities. 

The structure of the paper follows. In §2 we describe our 
data and weak lensing analysis. In particular we discuss how 
we correct for the effects of CTE in our ACS data and how we 
correct for PSF anisotropy. The measurements of the cluster 
masses, and the comparison to the X-ray properties are pre- 
sented in §3. We compare our results to previous work and 
examine biases in our mass estimates that arise from uncer- 
tainties in the position of the cluster center and sample selec- 
tion. Throughout this paper we assume a flat ACDM cosmol- 
ogy with fi m = 0.3 and Hq = lOhjo km/s/Mpc. 



2. DATA AND ANALYSIS 

The data studied here were obtained as part of a snap- 
shot program (PI: Donahue) to study a sample of clusters 
found in the 160 square degree survey (Vikhlinin et al Jl 1 998k 
iMullis et al. 2003). The clusters from the latter survey were 
selected based on the serendipitous detection of extended X- 
ray emission in ROSAT PSPC observations, resulting in a to- 
tal survey area of 16 deg 2 . A detai l ed dis cussion of the sur- 
vey can be fou nd in | Vikhlinin et al.l (119981) . The sample was 
reanalysed by Mull is et al.l (120031) . which also lists spectro- 
scopic redshifts for most of the clusters. 

The X-ray luminosity as a function of cluster redshift is 
plotted in FigureQ] The HST snapshot program targeted clus- 
ters with 0.3 < z < 0.6, which were observed during HST cy- 
cles 13 and 14. This resulted in a final sample of 25 clusters 
that were imaged in the F814W filter with integration times 
of ~ 2200s. Note that these were drawn randomly from the 
sample, and therefore represent a fair sample of an X-ray flux 
limited survey. The clusters with HST data are indicated in 
Figure[T]by the large open points. 

Table Q] provides a list of the clusters that were observed. 
It also lists the restframe X-ray luminosities in the 0.1-2.4 
keV band. The values were taken from the BAX X-Rays 
Galaxy Clusters Databas^] and converted to the cosmology 
we adopted here. As explained on the BAX website, the lu- 
minosities were derived from the flux measurements using a 
Raymond-Smith type spectrum assuming a metallicity of 0.33 
times solar. 

9 http://bax.ast.obs-mip.fr/ 
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Figure 1. X -ray luminosity as a function of redshift for the 160 square degree 
survey from Mullis et al. 1 2003). The large open points indicate the clusters 
for which HST imaging data were obtained. 

Each set of observations consists of four exposures, which 
allow for efficient removal of cosmic rays. We use the 
multi drizzle task to remove the geometric distortion 
of ACS images and t o combine the multiple exposures 
dKoekemoer et alj|2002l) . This task also removes cosmic rays 
and bad pixels, etc. Because of the large geometric distor- 
tion and offsets between the individual exposures, the images 
need to be resampled bef ore co-addition. A n umber of op- 
tions have been studied by R hodes et al.l d2007l) who suggest 
to use a Gaussian kernel and an output pixel scale of 0.03". 
However, this procedure leads to correlated noise in the final 
image. Instead we opt for the lanczos3 resampling kernel 
and keep the original pixel size of 0.05". These choices pre- 
serve the noise properties much better, while the images of 
the galaxies are adequately sampled. These co-added images 
are subsequently used in the weak lensing analysis. 

2.1. Shape analysis 

The first step in the weak lensing analysis is the detec- 
tion of stars and galax ies, for which we use SExtractor 
dBertin & Arnoutslll99 6'). The next important step is the un- 
biased measurement of the shapes of the faint background 
galaxies that will be used to quantify the lensing signal. To 
do so, we measure weighted quadrupole moments, defined as 



(1) 



Iij= / d xxiXjW{x)f(x), 



where W(x) is a Gaussian with a disp ersion r g , which is 
matched to the size of the object (see iKaiser et alJ 1 19951 : 
iHoekstraet all 1 9981 for more details). These quadrupole mo- 
ments are combined to form the two-component polarization 
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However, we cannot simply use the observed polarizations, 
because they have been modified by a number of instrumen- 
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Table 1 

Summary of the observational data for the cluster sample 



name 


number 


z 


L>X 


RAx-ray 


L)h,Cx-ray 


RAbcg 




CiBCG 


\P) 


\P ) 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


(10) 


(11) 


RXJ 0056. 9 — 2740 


6 


0.563 


1.14 


r\r\h c tun c f 1 v 

00 56 56.1 


T"lO Af\l 1 ^// 

—27 40 12 


aa/j c r "tn c/r AOS' 

00 56 56.98 


ono 1 a/ta a// 

—27 40 29.9 


2 


0.42 


0.23 


t» v t a 1 1 a i i mo o 

KXJ01 10.3 + 1938 


8 


0.317 


0.40 


a 1 h 1 run to ns 

or io i8.o 


1 1 AO oo/n// 

+19 38 23 


Al ftl A/// 1 O 11 S 

01 10 18.22 


1 1 ao io/in /i // 

+19 38 19.4 





0.61 


0.43 


T»"V"TA1 C /I 1 Cm "7 

KXJ0154.2 — 593 7 


20 


0.360 


0.90 


f\'\ h C AI11 ■t A OS 

01 54 14.8 


rno T7/ a ft 

59 37 48 


A 1 h C A III A 1 "70.S 1 

Or 54 13.72 


caOoi/ti a// 

—59 3 7 31.0 


1 


0.57 


0.38 


T» "V TA COO O T/'TC 

KX J 05 22 . 2 — 3 625 


41 


0.472 


2.07 


05 22 14.2 


—36 25 04 


05 22 15.48 


—36 24 56.1 


1 


0.48 


0.29 


RXJ0826. 1 + 2625 


52 


0.351 


0.65 


rtoll o /T 111 df AS 

08 26 06.4 


1 1 v" O ^ c / A^il 1 

+26 25 47 


r\ah ill i\r\ azS 

08 26 09.45 


iT/'O^c/ao 1// 

+26 25 03.1 





0.58 


0.39 


KXJ0841.1 +6422 


56 


0.342 


1.60 


noli A t 111 A^ A t 

08 41 07. 4 


+64 22 43 


F\oll A 1 /if A"7 £CV 

08 41 7.65 


1 /:/ioio/o£ A// 

+64 22 26.0 


2 


0.59 


0.40 


RXJ0847. 1 + 3449 


59 


0.560 


1.17 


r\oh ahii 1 "i is 

08 47 11.3 


+34 49 16 


r\ah A'nm 1 1 t a v 

08 47 1 1.79 


+34 48 51. 8 


1 


0.42 


0.23 


T» V TAA 1 n £ i ^T/IO 

KXJ 09 1 0. 6 + 4248 


69 


0.576 


1.44 


nr\h 1 mil i a is 

09 10 39.7 


1 /I lO ,1 0/ A A II 

+42 48 41 


An/i 1 A7ii yl A CIS 

09 10 40.53 


1 /no in/cn 1// 

+42 49 59.1 


-1 


0.41 


0.23 


KXJ 092 1.2 + 4528 


70 


0.315 


1.40 


nn/io i m \i as 

09 21 13.4 


+45 28 50 


09 21 13.46 


1 A CO 00/ c/: 1 // 

+45 28 56.1 


1 


0.61 


0.43 


T\ V TAAO jC i 

KXJ0926.6 + 1242 


71 


0.489 


2.04 


09 26 36.6 


+12 42 56 


09 26 36.70 


1 nOn/m oil 

+12 43 03.8 


2 


0.47 


0.28 


KXJ095 7.8 + 6534 


80 


0.530 


1.39 


09 57 53.2 


+65 34 30 


Aa// Cl/fl C 1 T1,S 

09 5 7 51.22 


. /CCOl/l/oc 1 // 

+65 34 25.1 


2 


0.44 


0.25 


T» V T1 Al C 1 i A m 1 

KXJ 1015.1 +4931 


88 


0.383 


0.78 


i r\h 1 cm/AO c< 

10 15 08.5 


1 A AO Tl/n'/ 

+49 31 32 


1 a/; 1 f/iiAO /i /i s 1 

10 15 08.44 


1 -lAOOl/CA O// 

+49 31 50.8 


2 


0.55 


0.36 


"Tl \/ T1 1 1 "7 O i X'H A A 

RXJ 1 1 17.2+1744 


96 


0.305 


0.54 


11/71 T/o 1 ns 

1 1 17 12.0 


1 ITO/l^/^/l// 

+17 44 24 


11 17 11.23 


1 1 1 O /I A I A A C / / 

+17 44 00.5 


2 


0.62 


0.44 


KXJ 1 1 17.4 + 0743 


97 


0.477 


0.76 


1 1 1 7 26. 1 


1 ATO A "1 I 1 c// 

+0/ 43 35 


I iftl Ull^C f\AS 

II 17 26.04 


1 m n/ io i / / 

+07 43 38.3 


1 


0.48 


0.29 


T» V Tl 111 1 i 1/1 AA 

KXJ 1 123.1 + 1409 


101 


0.340 


1.01 


I Wm/Ji in is 

I I 23 10.2 


1 1 /i An/ a a II 

+14 09 44 


I 1 /iniii 1 A ACS 

I I 23 10.95 


1 1 iOao/t^ a II 

+14 08 36.4 


1 


0.59 


0.40 


DY1 1 T,*\A 9 099 1 
KAJ 1 Jjf.Z — UZZ 1 


1 S 1 


U. J+O 


1 S"* 

i.jj 




09° 9 1 1 Al" 
— UZ Z 1 H- / 




— UZ Zl J7.Z 




z 




n ia 
u.z^ 


RXJ1524.6 + 0957 


170 


0.516 


3.94 


15 A 24 ra 40.3 I 


+09° 57' 39" 


15''24 m 41.56 s 


+09°57'34.3" 


1 


0.45 


0.26 


RXJ1540.8 + 1445 


172 


0.441 


0.81 


15*40*53.3' 


+14°45'34" 


15''40 m 53.96 s 


+14°45'56.0" 


1 


0.51 


0.31 


RXJ 1642.6 + 3935 


186 


0.355 


0.62 


16*42 m 38.9 s 


+39°35'53" 


16 A 42 m 38.35 I 


+39° 36' 10.4" 


1 


0.58 


0.39 


RXJ2059.9-4245 


199 


0.323 


2.44 


20'59" , 55.2 I 


-42°45'33" 


20 A 59 m 54.92 s 


-42°45'32.1" 


2 


0.61 


0.42 


RXJ2108.8-0516 


200 


0.319 


0.81 


21*08 m 51.2 s 


-05°16'49" 


21 A 08'"51.17 S 


-05°16'58.4" 


2 


0.61 


0.42 


RXJ2139.9-4305 


203 


0.376 


0.59 


21 A 39 m 58.5' 


-43°05'14" 


2l''39 m 58.22 I 


-43°05'13.9" 


2 


0.56 


0.37 


RXJ2146.0 + 0423 


204 


0.531 


2.61 


2l' , 46 m 04.8 I 


+04°23'19" 


2l''46'"05.52 I 


+04°23'14.3" 


2 


0.44 


0.25 


RXJ2202.7-1902 


205 


0.438 


0.70 


22 /l 02'"44.9 s 


-19°02'10" 


22*02 m 45.50 I 


-19°02'21.1" 


2 


0.51 


0.31 


RXJ2328.8 + 1453 


219 


0.497 


1.02 


23*28" , 49.9 I 


+14°53'12" 


23 A 28 m 52.27 s 


+14°52'42.8" 


1 


0.46 


0.27 



Note. — (2) entry number for the cluster in Mullis et al. 12003); (3) cluster redshift from Mullis et al. 12003); (4) restframe X-ray luminosity in the 0.1—2.4 
keV band in units of 10 44 /i^ ergs/s. (5) & (6) location (J2000.0) of the X-ray position from lMullis et~aTI i2003l) . (7) & (8) location (J2000.0) of the assumed 
brightest cluster galaxy; (9) 'quality' of BCG identification (see §2.5 for details); (10) & (11) mean /3 = D\ s jD s and variance as explained in the text. 



tal effects. Although the PSF of HST is small compared to 
ground based data, the shapes of the galaxies are nonethe- 
less slightly circularized, lowering the amplitude of the ob- 
served lensing signal. Furthermore, the PSF is not round and 
the resulting anisotropy in the galaxy shapes needs to be un- 
do ne. To do so, we use the well established method proposed 
by iKaiser et alj d|99 5), with the modificati ons pr ovided in 
Luppi no & Kaiserl (1 19971) and lHoekstra etal] (TT998ll2000b . 

2.2. Correction for CTE degradation 

An issue that is relevant for ACS observations is the degra- 
dation of the charge transfer efficiency (CTE) with time. Over 
time, cosmic rays cause an increasing number of defects in 
the detector. During read-out, these defects can trap charges 
for a while. The delayed transfer of charge leads to a trail of 
electrons in the read-out direction, causing objects to appear 
elongated. The effect is strongest for f aint objects, because 
brighter objects quickly fill the traps. iRhodes et alj (12007b 
provide a detailed discussion of CT E effects and their im pact 
on weak lensing studies. Similar to Rhodes et alj ( 120071) we 
derive an empirical correction for CTE, but our adopted ap- 
proach differs in a number of ways. 

The presence of CTE in our data leads to a slight modifica- 
tion of our lensing analysis. First, we note that CTE affects 
only e\ and that the change in shape occurs during the read- 
out stage. Hence the measured value for e\ needs to be cor- 
rected for CTE before the correction for PSF anisotropy and 
circularization: 



where e\ is the predicted change in polarization given by 
the model derived in the Appendix. To derive our CTE model, 
we used observations of the star cluster NG C 104 as well 
as 10 exposures from the COSMOS survey dScoville et alj 
120071) . The observations of a star cluster allow us to study 
the effect of CTE as a function of position, time and signal- 
to-noise ratio with high precision, because stars are intrinsi- 
cally round after correction for PSF anisotropy. We find that 
the CTE effect increases linearly with time and distance from 
the read-out electronics. The amplitude of the effect is ob- 
served to be proportional to ^S/N. We use 100 exposures 
from COSMOS to examine how the CTE effect depends on 
source size. We find a strong size dependence, with e^ TE oc r~ 2 
(where r g is the dispersion of the Gaussian weight function 
used to measure the quadrupole moments). 

2.3. Correction for the PSF 

Once the CTE effect has been subtracted for all objects (in- 
cluding stars), the le nsing analysis proceeds as described in 
iHoekstra et alJ (Il998h . Hence the next step is to correct both 
polarization components for PSF anisotropy. The ACS PSF 
is time dependent and therefore a different PSF model is re- 
quired for each observation. An added complication is the 
fact that only a limited number of stars are observed in each 
image. 

To estimate the spatial variation of the PS F anisotropy 
a number of procedures have been proposed (Rhod es et alj 
l2007 HSchrabback et al.l20 07l|2010|) Here we opt fo r a simple 
approach, similar to the one used in Hoekst ra et alj ([1998). 

We use observations of the star cluster NGC 104 to derive 
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Figure 2. PSF reference model based on observations of NGC104. The 
sticks show the direction and amplitude (indicated by the length) of the po- 
larization of the PSF. The PSF anisotropy is generally small, but can reach 
4% at the edges of the field. 



Figure 3. The ensemble averaged PSF polarization e\ and ei as a function 
of position using stars detected in our galaxy cluster exposures. The open 
circles show the PSF anisotropy before correction, whereas the solid points 
correspond to the residuals after applying the best fit PSF model to the stars. 



an adequate model for PSF anisotropy. These data were taken 
at the start of ACS operations (PID 9018), and therefore do 
not suffer from CTE effects. We model the PSF anisotropy 
by a third order polynomial in both x and y. Such a model 
would not be well constrained by our galaxy cluster data, but 
it is thanks to the high number density of stars in NGC104. 
We select one of the models as our reference, because the 
PSF pattern varied only mildly from exposure to exposure. 
The resulting reference model is shown in Figure[2] The PSF 
anisotropy is fairly small, but can reach ~ 4% towards the 
edges of the field. 

Most of the variation in the ACS PSF arises from focus 
changes, which are the result of changes in the telescope tem- 
perature as it orbits the Earth. We therefore expect that a 
scaled version of our model can capture much of the spatial 
variation: as the detector moves from one side of the focus 
to the other, the direction of PSF anisotropy changes by 90 
degrees, which corresponds to a change of sign in the polar- 
ization. To account for additional low order changes, we also 
include a first order polynomial: 



,P SF = ae NGC104 + flo + fliX + fl23; 



(4) 



This simple model, with only 4 parameters, is used to char- 
acterize the PSF anisotropy for each individual galaxy cluster 
exposure. The number of stars in the galaxy cluster images 
ranges from 7 to 84, with a median of 20. To examine how 
well our approach works, we averaged the shapes of all the 
stars in our 25 galaxy cluster exposures. The ensemble av- 
eraged PSF polarization as a function of position is shown 
in Figure [3] The open points show the average PSF shapes 
before correction. More importantly, the filled points show 
the residuals in polarization after the best fit model for each 
galaxy cluster pointing has been subtracted. These results 
suggest our model is adequate for our weak lensing analysis. 



2.4. Lensing signal 

The corrected galaxy shapes are used to measure the weak 
lensing signal. A convenient way to quantify the lensing sig- 
nal is by computing the azimuthally averaged tangential shear 
77- as a function of radius from the cluster center (see §2.5 
for our choice of the center). It can be related to the surface 
density through 



£«r)-E(r) 



K(< f) — K(f), 



(5) 



where £(< r) is the mean surface density within an aperture 
of radius r, and E(r) is the mean surface density on a circle 
of radius r. The convergence n, or dimensionless surface den- 
sity, is the ratio of the surface density and the critical surface 
density £ cr it, which is given by 



D, 



4vrG D,D, S 



(6) 



where D/ is the angular diameter distance to the lens. D s and 
Dis are the angular diameter distances from the observer to 
the source and from the lens to the source, respectively. The 
parameter (3 = max[0, D/ S /D s ] is a measure of how the am- 
plitude of the lensing signal depends on the redshifts of the 
source galaxies (where a value of is assigned to sources with 
a redshift lower than that of the cluster). 

One of the advantages of weak lensing is that the (pro- 
jected) mass can be determined in a model-independent way. 
To derive accurate masses requires wide field imaging data, 
which we lack because of the small field of view of ACS. In- 
stead we fit parametric models to the lensing signal. 

The singular isothermal sphere (SIS) is a simple model to 
describe the cluster mass distribution. In this case the conver- 
gence and tangential shear are equal: 
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K (r) = 7r (r)=-i, (7) 
2r 

where r E is the Einstein radius. In practice we do not observe 
the tangential shear directly, but we observe the reduced shear 
gr instead: 

gr=r 2 -- (8) 

1 - K 

We correct our model predictions for this effect. Under the 
assumption of isotropic orbits and spherical symmetry, the 
Einstein radius (in radian) can be expressed in terms of the 
line-of-sight velocity dispersion: 

r E = 4n{^-y p. (9) 

We use this model when listing lensing inferred velocity dis- 
persions in Table |2] 

A more physically m otivated model is the one proposed 
by lNavarro et al.l (119971) who provide a fitting function to the 
density profiles observed in numerical simulation s of cold 
dark m atter. We follow the conventions outlined in iHoekstral 
(2007), but use an update d relation between the virial mass 
M v i r and concentration c. iDuffv et all ([2008) studied numer- 
ical simulat ions using the best fi t parameters of the WMAP5 
cosmology (Komatsu et al. 2009). The best fit c(M v ; r ) is given 
by: 

/ M . \ -0-081 

We use this relation when fitting the NFW model to the lens- 
ing signal. Rather than the virial mass, in Table|2]we list M2500 
which is the mass within a radius ^500 where the mean mass 
density of the halo is 2500 times the critical density at the 
redshift of the cluster. We note that a different choice for the 
mass-concentration relation will change the inferred masses 
much. We found that the inferred values for M2500 change 
by 5-10% if we change the pre-factor in EquationfTOlbv ±1, 
which is much smaller than our statistical uncertainties. When 
comparing to ensemble averaged results from other studies, 
however, differences in the adopted mass-concentration rela- 
tion may become a dominant source of uncertainty. 

Finally, we note that the lensing signal i s sensitive to all 
matte r along the line-of-sight. As shown in IHoekstral (120011 
2003), the large-structure in the universe introduces cosmic 
noise, which increases the formal error in the mass estimate, 
compared to just the statistical uncertainty. The listed uncer- 
tainties in the weak lensing masses include this contribution. 

2.5. Cluster center 

We have to choose a position around which we measure the 
weak lensing signal as a function of radius. An offset be- 
tween the adopted position and the 'true' center of the dark 
matter halo will lead to an underestimate of the cluster mass. 
Possible substructure in the cluster core complicates a simple 
definition of the clu ster center, but it can also lead to biased 
mass estimates (e.g., Ho ekstra et al.ll2002l) . We expect our re- 
sults to be less affected by substructure because the ACS data 
used here ex tend to larger radii tha n the WFPC2 observations 
discussed in Hoekst ra et al.l (120021) . 

The resulting bias depends on the detailed proced ure that is 
used to to interpret the lensing signal. For instance. IHoekstral 
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Figure 4. Plot of the ratio of the inferred lensing mass and the true mass 
as a function of centroid offset. The lensing mass is obtained by fitting an 
NFW model to the shear at 200-750/?^ kpc (solid black curve) and 0.5- 
1.5/iyQMpc (dashed red curve). The blue dotted line corresponds to the bias 
if all data within 750/1^ kpc are used. The results presented here are for 
a cluster with a mass M2500 = 2 X 10 14 /!^Mq, but we note that the bias 
varies only by a few percent over the range of masses we consider here. The 
histogram indicates the frequency of offsets found for massive clusters from 
Bildf elletalJ<200l) . 

(120071) used wide field imaging data to measure the lens- 
ing signal out to large radii and to derive (almost) model- 
independent masses. This procedure minimizes the effect of 
centroiding errors because the large-scale signal is affected 
much less, compared to the signal on small scales. Unfortu- 
nately, the ACS field of view is relatively small, which pre- 
vents us from following the same approach and we fit a pa- 
rameterized mass model to the lensing signal instead. To re- 
duce the sensitivity of our results to centroiding errors and 
central substructure, we fit the NFW and SIS models to the 
tangential distortion at radii 200 < r < 750/i7 ( )kpc. 

The advantage of restricting the analysis to larger scales is 
also evident from Figure |H where we show the bias in mass 
as a function of centroid offset r D ff. The solid curve shows 
the results when fitting an NFW model to the signal within 
200- 750/i 70 kpc, which is the range we use for the ACS data 
st udied here. 

iMullis et alj ((2003) provide cluster positions based on the 
X-ray emission. For the clusters in our sample, the listed cen- 
troiding uncertainty depends on the X-ray luminosity (with 
smaller values for the more luminous systems). An alternative 
approach is to use the location of the brightest cluster galaxy 
(BCG). The resulting positions are listed in TableQ] In many 
cases a clear candidate can be identified (indicated by a value 
of 2bcg of 1 or 2 in Table Q]), but in a number of cases the 
choice is ambiguous (indicated by Qbcg -1 or 0). 

High quality X-ray data can be used to refine the centers, 
but such data are lacking for our sample. Even if such data 
were available, there can be a offs et between the BC G and 
the peak in the X-ray emission (e. gjBildfell et al.1l2008l) . The 
distribution of offsets observed by Bild fell et alj (|2008) for the 
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massive clusters in the Canadian Cluster Comparison Project, 
are indicated by the histogram in Figure [4] It is clear that the 
offsets are typically less than 50 kpc, leading to biases less 
than 5%. The larger offsets are found for merging massive 
systems where the identification of the BCG is difficult. Such 
major mergers do not appear to be present in the sample of 
clusters studied here. 

We compared the offset between the X-ray position and 
th e adopted lo c ation of the BCG to the uncertainty listed 
in lMullis"et al. (2003) and find fair agreement: most of the 
BCGs are located within the radius corresponding to the 90% 
confidence X-ray position error circle. Only for 4 low lumi- 
nosity systems do we find an offset that is much larger. The 
positions of the BCGs can be determined more accurately, and 
we therefore adopt these as the cluster centers when listing our 
mass estimates. 

2.6. Source galaxies 

The lensing signal is largest for background galaxies at red- 
shifts much larger than the cluster. We lack redshift informa- 
tion for our sources and instead we select a sample of faint 
(distant) galaxies with 24 < F814W < 26.5, which also re- 
duces contamination by cluster members which dominate the 
counts at bright magnitudes. Nonetheless, contamination by 
cluster members cannot be ignored because we lack color in- 
formation. We note that adding a single color would not im- 
prove the situation significantly because the faint members 
span a wide range in color, unlike the bright members, almost 
all of which occupy a narrow red-sequence. 

To account for contamination by cluster members we mea- 
sure the number density of faint galaxies as a function of dis- 
tance from the adopted cluster center and boost the signal by 
the inferred fraction of cluster members. The corrected tan- 
gential distortion for RXJ0847. 1+3449 as a function of dis- 
tance from the adopted cluster centre is shown in Figure [5^. 
The red line shows the best fit singular isothermal sphere 
model (fitted to radii > 25")- If the observed signal is caused 
by gravitational lensing, the signal should vanish if the source 
galaxies are rotated by 45 degrees. Figure^ shows the results 
of this test, which are indeed consistent with no signal. 

The lower panel in Figure [5] shows the excess counts A«b g 
as a function of radius for the cluster RXJ0847. 1+3449 (z = 
0.56). To determine the background count levels we used the 
100 COSMOS pointings that were used to measure the size- 
dependence of CTE. We measure the excess counts for each 
cluster individually, because the sample spans a fair range in 
redshift and mass. 

This cluster shows a signicant excess of faint galaxies over 
the background number density of ~ 61 arcmin" 2 . We as- 
sume that these faint cluster members are oriented randomly 
and thus simply dillute the lensing signal. To quantify the 
overdensity of faint members we adopt A«b g oc r~ l and de- 
termine the best fit normalization for each cluster separately 
(indicated by the drawn line in Figure|5};. This simple model 
is used to correct the observed tangential distortion for con- 
tamination by cluster members. We find that this correction 
leads to an average increase in the best fit Einstein radius of 
~ 20%. The uncertainty in the amplitude of the contamina- 
tion is included in our quoted measurement errors (which are 
increased by less than 5%). 

The conversion of the lensing signal into an estimate for 
the mass of the cluster requires knowledge of the redshift 
distribution of the source galaxies. These are too faint to 
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Figure 5. panel a: Tangential distortion as a function of radius for 
RXJ0847. 1+3449. The measurements have been corrected for contamina- 
tion by faint cluster member using the best fit model to the excess counts 
shown in panel c. The best fit SIS model based on a fit to the measurements 
at r > 25 arcseconds is indicated by the drawn line. The arrow indicates a 
radius of 0.5/i~'Mpc. panel b: The signal when the sources are rotated by 
45 degrees. In this case the signal should vanish if the measurements shown 
in panel a are due to gravitational lensing. panel c: Excess counts of galax- 
ies with 24 < F814W < 26.5 as compared to the mean counts based on 100 
frames from the COSMOS survey. The expected background counts are ^61 
arcmin -2 . The drawn line is the best fit 1 jr model, which is used to correct 
the measurements of the tangential distortion. 



be included in spectroscopic redshift surveys or even from 
ground based photome tric redshift surveys (e.g., IWolf et aTl 
l2004t Hlbert et al.ll2006l) . Instead we use the photometric red- 
shift distributions determine d from the Hubble Deep Fields 
dFernandez-Soto et al.lll999h . For the range in apparant mag- 
nitudes used in the lensing analysis, the resulting average 
value for (jS) a nd the variance (j3 2 ) are listed in Table Q] 
As discussed in Hoekst ra et alj (|2000) the latter quantity is 
needed to account for the fact that we measure the reduced 
shear. We estimate the uncertainty in f3 by considering the 
variation between the two HDFs. As the average source red- 
shift is much higher than the cluster redshifts, the resulting 
relative uncertainties are small: ^2%atz = 0.3, increasing to 
~ 5% at z = 0.6, much smaller than our statistical errors. 

3. RESULTS 

As discussed above, we consider two often used parametric 
models for the cluster mass distribution. We fit these to the 
observed lensing signal at 200-750/;7 ( )kpc from the cluster 
center. The resulting Einstein radii and velocity dispersions 
are listed in Table [2] We also list the best-fit value for M2500 
inferred from the NFW model fit to the signal, where we use 
the mass-concentration relation given by Eqn. [10] For refer- 
ence we also list the corresponding value for r25oo from the 
NFW fit. 

We use this radius to compute A^soo, the excess num- 
ber of galaxies with a rest-frame B-band luminosity -22 < 
Mb < -18.5 within r25oo (we assume passive evolution when 
computing the rest-frame B-band luminosity). Recent stud- 
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Table 2 

Measurements of cluster properties 



name 


rE 


17 


r 2500 


M2500 


-^2500 




n 


[km/s] 


[ft^Mpc] 


[^1O 13 M ] 





RXJ0056 
RXJ0110 
RXJ0154 
RXJ0522 
RXI0826 
RXJ0841 
RXJ0847. 
RXJ0910. 
RXJ0921 
RXJ0926. 
RXJ0957. 
RXJ1015 
RXJ1117 
RXJ1117 
RXJ1123 
RXJ1354. 
RXJ1524 
RXI1540 
RXJ1642 
RXJ2059 
RXJ2108. 
RXJ2139 
RXJ2146. 
RXJ2202 
RXJ2328 



.9 - 2740 

3 + 1938 
.2-5937 
.2-3625 
.1+2625 
.1+6422 

1 + 3449 
6+4248 
.2+4528 
6+1242 
8 + 6534 
.1+4931 
2+1744 

4 + 0743 
.1 + 1409 
.2-0221 
6 + 0957 
8 + 1445 
6 + 3935 
9-4245 

8- 0516 

9- 4305 
.0+0423 
.7-1902 
8+1453 



5.4±1.5 
5.8± 1.3 
3.6±1.4 
6.8± 1.3 
2.4±1.8 
7.5 ±1.3 
10.3 ± 1.7 
3.5 ±1.7 

2.1 ±1.5 
8.9± 1.5 
4.5 ±1.4 
6.0±1.4 
1.9±1.6 
5.5 ±1.4 
5.8±1.4 
9.6±1.4 
4.3 ±1.3 
4.0±1.4 
2.2± 1.5 
4.4±1.3 
3.4±1.4 

5.2 ±1.4 
8.5 ±1.4 
1.5 ± 1.4 
4.9 ±1.4 



67835, 
474+83 

384!}^ 
668!« 

551ii 

344^ 

822*jj 

605+ 89 

618!™ 

14 

JJi -185 

640!« 

-78 
895 +64 

585_gg 

530 + « 
37 1 + ! 07 



588+ 



508+ 



-156 
S+71 
-82 
449+81 

572+ 73 

319^|7 
612$ 



0.270 
0.293 
0.218 
0.313 
0.157 
0.354 
0.452 
0.246 
0.180 
0.407 
0.257 
0.296 
0.252 
0.280 
0.271 
0.428 
0.336 
0.279 
0.239 
0.280 
0.210 
0.292 
0.436 
0.152 
0.254 



5 2+ 4 ' 2 
J - -3.0 

5 0+ 3 - 4 
7 9+2.6 
z - z -1.4 
7 9+4.2 
/,z -3.1 

0.8=1;! 

24.2«| 

1 2 +L8 
liZ -1.2 

16.2*? 

4 3+ 3 ' 2 ' 

^■ J -2.6 

5.6± 34 

3 1+ 3 ' 2 

5 2 +3 ' 4 
J -2.8 

4 0+ 3 - 8 
^■ u -2.6 

q "s+4.4 
? - -3.8 

4.9«j 

4 4+3.3 
^-2.4 

0+2.2 

S -1.4 

5 3+ 3 - 7 
1+6.7 



1. 



0.8^ 



13±4 

4±2 

1±1 

7±3 

2±1 

15 ±4 

21 ±5 

3±2 

5±2 

11 ±4 

3±2 

3±2 

6±3 

13 ±4 

9±3 

23 ±5 

14±4 

13 ±4 

5±2 

6±3 

5±2 

9±3 

14±4 

2±1 

6±3 



Note. — The SIS and NFW models were fit to the data at 200-750/!^kpc. The listed errors include the contribution from large-scale structure along the 
line-of- sight. The listed value for rum is derived from the best-fit NFW model. This radius is used to compute /V2500. the excess number of galaxies with a 
rest-frame 6-band luminosity —22 < Mb < —18.5 within f^oo- 



ies of the cluster richness consider only galaxies on the red- 
sequence, because of the higher contrast, which improves the 
signal-to-noise ratio of the measurement. Unfortunately we 
lack color information, and we compute the total excess of 
galaxies. The background count levels were determined us- 
ing the COSMOS pointings that were used to measure the 
size-dependence of CTE. 

Figure [6^ shows the resulting lensing mass as a function of 
the restframe X-ray luminosity in the 0.1-2.4 keV band. The 
luminosities and masses have been scaled to redshift zero as- 
suming s elf-similar evolution with respect to th e critical den- 
sity (e.g.. lKaiserj|1986HBrvan & Normanll 1998b . where 



E(z) = -^- = V^ m a+z) 3 +n A (ii) 

no 

for flat cosmologies. The solid points indicate the clus- 
ters from the sample studied here. The clusters from the 
160SD survey for which X-ray temperatures have been de- 
termined (see §3.1) are indicated by blue points. To extend 
the range in X-ray luminosity we also sh ow measureme nts for 
the massiv e clusters that w ere studied in Hoekstra (2007|). We 
note, that iHoekstral d2007l) used bolometric X-ray luminosi- 
ties from Hornet] (1200 ll) . whereas here we use the restframe 
luminosities in the 0.1-2.4 keV band (whi ch are a factor 
~ 4 sm aller). We use the mass estimates from Mahdavi et al. 
(2008) which used new photometric red shift distributions , 
which were based on much larger data sets (lllbert et al.l2006l) . 

The agreement in lensing masses is good in the regions 
where the two samples overlap. However, the scatter in the 



mass-luminosity relation is substantial (both for the clusters 
studied here as well as the more massive clusters). We exam- 
ined whether some of the scatter could be due to the uncer- 
tainty in the position of the cluster center, but find no differ- 
ence when comparing results for clusters with different lev- 
els of confidence in the identification of the BCG (see Qbcg 
in Table [T] It is worth noting, however, that we do not find 
any massive clusters (> 1O 14 M ) that are X-ray faint (i.e., 
Lx < 10 44 erg/s), implying that the dispersion in the M-Lx 
re lation is relatively well -behaved. 

Reiprich & Bohringer ( 2002) studied a sample of 63 X-ray 
bright clusters and derived masses under the assumption of 
hydrostatic equilibrium. This sample of clusters spans a sim- 
ilar range in Lx as our combined sample. We converted their 
measurements forMsoo to M2500 using the mass-concentration 
relation given by Eqn. [10] and show the results in Figure [6] 
(small grey points). The agreement with our findings is very 
g ood. A more quantitative comparison is presented in §3.3. 

lYee & Ellingson (2003) have shown that there is a good re- 
lation between the cluster richness and the mass (and vari- 
ous proxies) for massive clusters. Our study extends to lower 
masses and as is shown in Figures |6}3 and c A^soo correlates 
well with both the X-ray luminosity and lensing mas£3- Sim- 
ilar r esults have been obtained using SPSS cluster samples 
(e.g. jRvkoff et alJl2008bl:lJohnston et alj|2007h . We assumed 
that A^soo scales as the mass M2500, which is a reasonable as- 
sumption if the galaxies trace the density profile. This choice, 

10 To compute /V2500 for the sample of massive clusters we used the same 
selection criteria as discussed above. 
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Figure 6. panel a: Plot of M2500 as a function of X-ray luminosity. To account for the range in redshift of the clusters, the mass and luminosity have been 
rescaled using the corresponding value for E(z) under the assumption of self-similar evolution. The solid points are the clusters from the sample studied here. 
The solid blue points are the clu sters from the 16 0SD survey for which X-ray temperatures have been determined (see §3.1). The open points correspond to the 
more massive clusters studied in Hoekstra 1 2007). The open red points indicate the well-known strong lensing clusters A370 and CL0024+16. For comparison 
the small grey points show the X-ray derived masses from Reiprich & Bohringer 12002), converted to M2500; panel b: Plot of M2500 as a function of A^soOi me 
excess of galaxies with restframe —22 < Mg < — 18.5 within ^2500; panel c: Plot of N2500 as a function of X-ray luminosity. 



however, does not affect our conclusions. 

The results agree well in the regions where the two samples 
overlap. The correlation between A^soo and M2500 is tighter 
than that of A^soo and Lx- The former is less sensitive to 
the projections along the line-of-sight (either substructures or 
an overall elongation of the cluster), because both the galaxy 
counts and the lensing mass are derived from projected mea- 
surements. The X-ray results provide a different probe of the 
distribution of baryons, which is expected to lead to additional 
scatter. Furthermore, some of the scatter may be caused by 



unknown contributions by AGNs. The importance of AGN 
can be evaluated using a combination of deeper, high spatial 
resolution (< 5") X-ray and radio imaging. Such X-ray ob- 
servations would also provide estimates for the temperature of 
the X-ray gas (which is a better measure of the cluster mass). 
Unfortunately, such data exist for only five of the low-mass 
clusters. For these, X-ray temperatures have been derived, 
which are li sted Tab le [3] For the massive cl usters we use th e 
values from Horner ( 200 1 ) that were used in Hoekstra ( 2007 ) . 
We note that all clusters follow a tight L x - T x relation. 
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Table 3 

X-ray temperatures 



I0 1: 



name 


kT x [keV] 


ref. 


RXJ01 10.3 + 1938 


1 46 MA9 

iMO -OA5 


1 


RXJ0847. 1+3449 


3 62+ 58 


2 


RXJ1117.4 + 0743 


z, -j+0.42 
■'• J -0.36 


3 


RXJ1354.2-0221 


3.66;g« 


2 


RXJ1524.6 + 0957 


5.1±6.36 


4 



Note, — referenc e s: miBruch eta[U2010l) : (21 ILumb et aTH2004T) ; (3) 
Carra sco et aTH 20071: Vikhlinin et al. (2QQ2) 



3.1. Comparison with X-ray temperature 

Figure [7] shows the resulting plot of M2500 as a function of 
X-ray temperature. RXJ1 1 17.4+0743 and RXJ1524.6 + 0957 
lie on the tight relati on defined by the bulk of the c lusters. The 
measurements from Reiprich & Btihringer ( 2002) also follow 
this relation (light grey points). However, RXJ0847.1 +3449 
and RXJ 1354.2 -0221 appear to be more massive than might 
be expected based on Tx- They appear to lie o n a paral- 
lel rel ation, along with some of the clusters from iHoekstral 
( 120071) . The latter clusters are CL0024+16 and A370, which 
are well known strong lensing clusters (indicated in red in 
Figures [6] and [7). These clusters were observed because of 
their strong lensing properties an d included in the search for 
archival CFHT data carried out by Hoekstra (20Q2D. 

Interestingly, all four clusters are outliers on both the M2500- 
Lx and N2SO0 ~~ A* relations presented in Figure [6] but follow 
the mass-richness relation. This is consistent with the pres- 
ence of (sub)structures along the line-of-sight boosting both 
M2500 and N2500: the projection of two mass concentrations 
(along the line-of-sight) would increase the richness and the 
weak lensing mass approximately linearly. The inferred X-ray 
temperature on the other hand will be close to that of the more 
massive system, whereas the X-ray luminosity will be much 
lower than expected, because it is proportional to the square 
of the electron density. We note that both RXJ0847.1 +3449 
and RXJ 1354.2 -0221 show evidence of strong lensing. 

Interestingly, th e X-ray image of RXJ0847 . 1 + 3449 in 
iLumb et ail (120041) shows evidence for a nearby cluster can- 
didate. The case is less clear for RXJ 1354.2 -0221, but the 
X-ray image shows a complex morphology. Unfortunately we 
lack the dynamical data to confirm whether RXJ0847. 1 +3449 
and RXJ 1 354.2 - 0221 are projected system s. The cluster 
RXJ1 1 17.4 + 0743 was studied in detail by ICarrasco et all 
(120071) . who find that this cluster i s also a project i on of two 
structures. For the main component lCarrasco et al.l (120071) in- 
fer a galaxy velocity dispersion of 592 ± 82 km/s, whereas 
the second structure is less massive with <r v = 391 ± 85 km/s. 
Based on our lensing analysis we obtained a velocity disper- 
sion of 639^ km/s, in good agreement wi t h the dynamical 
results for the main cluster. Carrasco et al. (2007) also per- 
formed a weak lensing analysis based on ground based imag- 
ing data and obtained a velocity dispersion of a = 778 ± 89 
km/s (where we took the average of their results for the g' 
and r 1 band), implying a mass 50% larger than our estimate. 
We are not able identify an obvious cause for this difference, 
but note that PSF-related systematics have a larger impact on 
ground based results. 

3.2. The M25QQ — LX scaling relation 

In this section we examine the correlation between M2500 
and the X-ray luminosity, in particular the normalization and 
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kT x [keV] 



Figure 7. Plot of M2500 as a function of X-ray temperature. To account for 
the range in redshift of the clusters, the mass has been rescaled using the 
corresponding value for E(z) under the assumption of self-similar evolution. 
The solid blue points are the clusters from the sample studied here. The open 
points correspond to the more massive clusters studied in H07, whereas the 
small grey points show the results from Reiprich & Bohringer (2002). The 
well-known strong lenses A370 and CL0024+16 are indicated as red open 
points. 



the power law slope. We fit a power law model to the com- 
bined sample to maximize the leverage in X-ray luminosity 



E(z)M 2 500 =M x 



L x /E(z) 
2 x 10 44 erg/s 



(12) 



If we naively fit this model to the measurements we find 
that the value for \ 2 of the best fit is too high (x 2 =106 with 
43 degrees of freedom). This indicates that there is intrinsic 
scatter in the relation, which is also apparent from Figure [6] 
We need to account for the intrinsic scatter in the fitting pro- 
cedure, because ignoring it will generally bias the best fit pa- 
rameters. We fit the model to our measurements, which have 
errors that follow a normal distribution. The intrinsic scatter, 
however, can be described by a log-n ormal distribution (see 
e.g., Fig. 13 in Vikhlinin et al. 2009a). We will assume that 
the intrinsic scatter can be approximated with a normal distri- 
bution with a dispersion oq m ln(10)2log2 (we use the log 
with base 10). 

To fit the M2500 - Lx relation we follow a maximum like- 
lihood approach. For a model / with parameters a, the pre- 
dicted values are y, = f(xf,a). The uncertainties in x, and y, 
are given by cr A , and oy,-. If we assume a Gaussian intrinsic 
scatter oq -, in the coordinate, the likelihood L is given by 



1 



-exp 



[y,-/(x,;a)] 2 



'2wWi 

where w, accounts for the scatter: 



2w} 



w] = 



dx 



2^2 



(13) 



(14) 
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If we consider the logarithm of the likelihood it becomes clear 
why including the intrinsic scatter differs from standard least 
squares minimization: 

-21n£ = 2^1nw, + ^(^^) 2 + C, (15) 

where the second term corresponds to the usual \ 2 - F° r a 
linear relation with no intrinsic scatter, the first term is a con- 
stant for a given data set and the likelihood is maximimized 
by minimizing \ 2 - However, if intrinsic scatter is included as 
a free parameter, the first term acts as a penalty function, and 
cannot be ignored. 

The presence of intrinsic scatter also exacerbates the 
Malmquist bias for a flux limited sample, such as the 160SD 
survejo As a result the average flux of the observed sample 
of clusters is biased high compared to the mean of the parent 
population (in particular near the flux limit of the survey). To 
account for Malmqu ist bias we follow the pr ocedure outlined 
in Appendix A. 2 of Vikhlinin et al.1 (|2009a) and correct the 
X-ray luminosities before fitting the M2500 _ Lx relation. We 
find that the correction for Malmquist bias is relatively mod- 
est, increasing the normalisation Mx by ~ 5% and reducing 
the slope a by ~ 5<%El 

We find an intrinsic scatter of aiogL x |M = 0.23^oci4 (or a 
relative er ror of ~ 70%), in good ag reement with other 
studies. Reiprich & Bohringer (2002) list a somewhat 
larger s catter of ai ngf , v | y = 0.29 for the HIFLUGCS sample, 
whereas Vikhli nin et aUd2009ah and lMantz etaT1(l2010al) find 
ciogLx|M = 0.17 ± 0.02 and 18 ±0.02, r e specti vely. Our 
results also agree well with Sta nek et al.1 {2006) who find 
&\agM\L x = 0.19 ± 0.03, which is in good agreement with our 

value of 0.17^$. 

The likelihood contours for the power law slope a of the 
^2500 _ Lx relation and the mass M2500 of a cluster with a 
luminosity of Lx = 2 x 10 44 /!7oerg/s are shown in Figure [8] 
For the combined sample of clusters we find a best fit slope 
of a = 0.68 ±0.07 and a normalization M x = (1.2 ±0.12) x 
10 14 /!7()Mq. The inferred slope is consistent with the value of 
| expected for self-similar evolution (Kaiser 1986|). 

It is also interesting to compare the constraints from both 
sub-samples separately. The best fit slope for the clusters 
studied in lHoekstral (120071) is a — 0.55^qq!J with a normaliza- 
tion of M x = (1.5 ±0.2) x 10 14 /i^M Q (indicated by the pink 
contours The blue contours in Figure [8] indicate the con- 
straints for the HST sample studied here. The parameters are 

We assume that the clusters studied in Hoekstra 12007) do not suffer 
from Malmquist bias. 

12 We also computed the cluster mass function in a ACDM cosmology 
(using parameters from Evrard et al. 12002) and assigned X-ray luminosities 
using our best fit M2500 — Lx relation and intrinsic scatter and find similar 
biases. 

13 We note that the slope is different from the original Hoekstra (20Q3) re- 
sults, who found a = .43 ± 0. 1 . The reason for this large change is twofold. 
First, [Hoekstra ' 2007) did not account for intrinsic scatter in the fit of the 
scaling relation. Furthermore, the current analysis includes three clusters 
for which no X-ray data was available in Hoekstra (2007). The clusters in 
question are A209, A383 and MS1231+15. The latter two have the lowest 
X-ray luminosities and drive much of the change in slope, whereas includ- 
ing A209 does not change the previous results. Restricting the sample to the 
clusters that were used by Hoekstra (2007) to constrain the slope, we find 
a = 0.41 ±0.10, in agreement with the orginal result. Such a rather large 
variation in best-fit slope demonstrates the need for larger samples of clusters 
with multi-wavelength data. 




0.5 1 1.5 2 2.5 
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Figure 8. Likelihood contours for the slope of the M25oo — Lx relation and the 
mass of a cluster with a luminosity of Lx = 2 X lO 44 /;^ 2 , ergs/s. The contours 
indicate the 68.3%, 95.4% and 99.7% confidence limits on two parameters 
jointly. The side panels show the probability density distributions for each 
parameter (while marginalising over the other). The black curves correspond 
to the constraints determined from the combined sample of clusters, whereas 
the cyan and pink curves correspond to the low mass and high mass samples, 
respectively. 



not well constrained, with best fit values a = 0.63 ± 0.24 and 
a normalization M x = (1.0 ± 0.24 x 10 l4 /i^M Q . 

The differenc e in the constraints from the (extended) 
iHoekstral ((2007) sample and the 160SD systems may hint at 
a deviation from a single power law relation. As discussed 
in §2.5, however, the uncertainty in the position of the clus- 
ter center leads to an underestimate of the cluster mass , as 
does the presence of substructure. The Hoekstra] d2007l) re- 
sults are much less sensit ive to the s e prob lems. To quan- 
tify this, we combine the Hoekstra] (120071) results with the 
160SD clusters with Qbcc = 2 (12 systems) and the ones 
with Qbcg < 2 (13 systems). For the former we find Mx = 
(1.30±0.15)x 10 l4 /i^M© and a = 0.63 ± 0.08, whereas re- 
quiring Q B cg < 2 yields M x = (1.19 ±0.14) x 1O 14 /!^M and 
a = 0.68 ± 0.08. This comparison suggests that our normali- 
sation may be biased low by as much as ~ 10%. 

3.3. Comparison with X-ray samples 

The relation between the X-ray luminosity and cluster mass 
has been studied extensively. In this section we compare our 
measurements to a number of recent results, which are shown 
in Figure [9] Where needed, the X-ray luminosities are con- 
verted to the 0.1-2.4 keV band and Eqn.fTOlhas been used to 
convert masses to M2500 an d adjust the slopes (because the re- 
lation between M2500 an d ^200 (° r A^soo) is a power law with 
a slope less than 1.). 

Reip rich & Bohringer! ([2002) studied a sample of 63 clus- 
ters, with masses derived under the assumption of hydrostatic 
equilibrium. Their BCES bisector results for the flux-limited 
sample yields a = 0.60±0.05 andM v = (1.3±0.09 )x 10 14 Mp, 
are in dicated by the blue triangle in Fig. [9] Stan ek etaU 
(2006), however, have argued that these results suffer from 
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Malmquist bias. Instead they compared the X-ray number 
counts to the mass function in ACDM cosmologies and de- 
rived a = 0.54 ± 0.02 and a high normalization of Mx = (2. 1 ± 
0.1) x 10 14 M Q . This result, however depends strongly on the 
adopt ed value for erg , and combination with the WMAP3 data 
( Sperg el et al.120 07) lowers the normalization to M x = (1 .56± 
0.08) x 10 14 M Q (open pink triangle in Fig.©. IVikhlinin etaD 
(2009a) studied a sample of z ~ 0.05 and z ~ 0.5 clusters 
that were observed with Chandra. Their results with a = 
0.55 ±0.05 and M x = (1.43 ±0.08) x 10 14 M Q are indicated 
by the o pen orange square. Another recent study was pre- 
sented by Mantz et all (l2010aT) who found a fairly steep slope 
of a = 0.70 ±0.04 and M x = ( 1.42 ±0.28) x 10 14 M Q (purple 
square). 

The constraints from these studies are largely driven by X- 
ray luminous clusters (t his i s particularly true for the results of 
Vikhli ninetall(l2009al) and lMantz et ai1d2010al) ). If the slope 
of the M25oq-Lx varies with Lx, then it is more appropriate to 
co mpare these stud ies to the results from the sample studied 
in iHoekstral (l2007h . for which the agreement is indeed very 
good. When combined with the results from the 160SD sur- 
vey, the resulting scaling relation has a lower normalisation 
and steeper slope. To examine whether this could be caused 
by a change in slope, it is interesting to compare to measure- 
ments at lower luminosities. 

At the low X-ray luminosity end, Leauth aud et al.l (120101) 
studied X-ray groups using COSMOS. Because of the small 
area surveyed, the groups are less luminous than the sample 
of clusters from the 160SD survey studied here. Nonethe- 
less it is interesting to ex trapolate their results for compari- 
son. iLeauthaud et al.l (120101) use the mass-concentration rela- 
tion from lZhao et al.ld2009l) . We refit the 160SD sample using 
this relation and find that the average M2500 is unchanged, but 
that M200 is increased by 24%. We account for this whe n con- 
verting the measurements from Leauth aud et al.l (120101) . and 
find that they imply M x = (1.4 ±0.3) x 10 14 M Q and a slope 
a = 0.61 ±0.13 (indicated by the open green circle in Fig. [9j, 
in fair agreement with our results. We also note that this high- 
lights the difficulty in comparing results, especially when the 
analyses differ in detail. This is particularly relevant for the 
comparison with SDSS results in the following section. 

3.4. Comparison with SDSS 

Ryko ff et al.l (l2008al) measured the scaling relation between 
Lx and M200 f° r a large sample of optic ally selected cluster s 
found in the Sloan Digital Sky Survey (Ko ester et al.l l2007). 
This allowed them to extend the study of the M-Lx rela- 
tion to much lower luminosities, compared to most of the 
measurements presented in the previous section. The clus- 
ters were binned in richness and for each bin, the mean X-ray 
luminosity and weak lensing mass were determine d. In the 
lensing analysis, described in dJohnston et al.ll2007h . both the 
mass and concentration are fit as free parameters. The re- 
sulting values for c agree well with the relation presented in 
Duffy et al.l (120081) . Up on conversion to M?s no we find that 
the measurements from Rvkoffetal. (2008a) correspond to 
M x = (1.5±0.1)x 1O 14 /^AM and a = 0.55 ±0.04. This re- 
s ult is indicated in Figure |9] by the red point. 

Mandelb aum et al.l (2008b) have pointed out that t he weak 
lensing masses determined by iJohnston et al.l (120071) may be 
too low (by as much as ~ 24% ), because of a bias in t he source 
redshift distribution (also see ILeauth aud et al. 2010). The red 
arrow indicates the shift in normalization if the bias pointed 




M x [lOi%-JM ] 

Figure 9. Comparison of the slope of the M2500 - Lx relation and the mass 
of a cluster with a luminosity of Lx = 2 X 10 44 /j^ ergs/s. The contours 
are the same as in Figure [8] The blue triangle indic ates the resu lts from 
Reiprich & Bohringer 1 2002). The result from Stanek et al. 1 2006) is indi- 
cated by the open pink triangle. We also show constrain ts from two re- 
cent studiesofrnassiye clusters by [Vikhlinin et al. 1 2009a) (orange square) 
and Mantz et al. 1 2010a) ( purple square). The re sults from a study of X-ray 
groups in COSMOS by Leauthaud et al. 12010) are indicated by the green 
circle. Note that the accuracy of the comparison is limited, b ecause of dif- 
ference s in the mass-concentration relation. The results from Ry koff et alj 
1 2008a) based on SDSS data are indicated by the red point. The arrow in- 
dicates the possible shift in mass because of a bias in the source redshift 
distribution. 



out by Mandelbaum et al. (2008b) is correct. In that case the 
results from Ry koff et alj d2008al) disagree with all other mea- 
s urements. 

iRykoff et al.l (l2008al) find a shallower slope and higher 
mass, which appears to be inconsistent with our results. As 
discussed above, uncertainties in the adopted cluster centers 
may lead us to underestimate our normalisation by as much 
as ~ 10%, wh i ch is not sufficient to remove the difference. 
IJohnston et al.l (120071) model the centroiding uncertainty us- 
ing results from mock catalogs. However, if the model over- 
esti mates the offsets the resulti ng masses will be biased high 
and lMandelbaum et al.l ((2008a) argue that this may indeed be 
the case. 

Another possible e xplanation for the large normalisation of 
IRykoff et alj d2008a) is that their results are expected to be bi- 
ased towards a lower Lx at a given lensing mass. The reason 
for this was already alluded to in §3.1, namely that some clus- 
ters appear X-ray underluminous and have low X-ray temper- 
atures, given their high masses. These clusters, however do 
follow the tight mass-richness relation. Ry koff et alj {2008a) 
bin their clusters in richness and analyse stacked X-ray and 
lensing data. The fact that ^2500 and M2500 are strongly cor- 
related, then leads to a larger mean mass at a given X-ray 
luminosity. 

The presence of substructure in the cluster (as well as fil- 
aments), will boost both the lensing and richness estimates 
relative to the X-ray luminosity. Such structures are numer- 
ous at the low mass end: it is rare to find an alignment of 
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rich clusters, because they are rare, whereas an alignment of 
groups should be more frequent. Hence, the bias is likely to 
be mass dependent, increasing the masses of low X-ray lumi- 
nosity systems by a larger fraction, compared to X-ray lumi- 
nous clusters. The consequence is a flatter slope of the M-L x 
relation. 

Comparison with numerical simulations can help clarify 
the amplitudes of the various biases listed above. How- 
ever, the argument presented above highlights the danger of 
stacking samples of clusters when parameters are correlated. 
Identifying and quantifying such correlation requires well- 
characterized (both in selection and data coverage) and large 
samples of clusters. The 160SD clusters provide an excel- 
lent starting point, but increasing the number of cluster in this 
mass range with high quality X-ray data is of great importance 
to better constrain the scaling relations and to help interpret 
the results from stacked samples of clusters. 

4. CONCLUSIONS 

To extend the observed scaling relation between cluster 
mass and X-ray luminosity towards lower L x , we have deter- 
mined the masses of a sample of 25 clusters of galaxies drawn 
from the 160 square degree ROSAT survey ( Vikhlin in et alj 
119981) . The clusters have redshifts 0.3 < z < 0.6, and the X- 
ray luminosities range from 2 x 10 43 to 2 x 10 44 erg/s. The 
masses were determined based on a weak lensing analysis of 
images in the F814W filter obtained using the ACS on the 
HST 

To measure the mass we assume that the brightest cluster 
galaxy indicates the center of the cluster. In most cases this 
leads to an unambiguous identification, but in a number of 
cases the choice of center is less clear. Nonetheless, we have 
verified that our choice of center does not affect the results for 
the inferred scaling relations. 

To correctly interpret the weak lensing data, we derived an 
accurate emperical correction for the effects of CTE on the 
shapes of faint galaxies. We detect a significant lensing signal 
around most of the clusters. To increase the range in cluster 
prope rties, we extend the sample with massive clusters stud- 
ied by Hoekstra (2007). The lensing masses agree well where 
the two samples overlap in L x . 

The inferred lensing masses correlate well with the over- 
density of galaxies (i.e., cluster richness). The relation be- 
tween mass and X-ray luminosity has significant intrinsic 
scatter. Under the assumption it follows a log-normal dis- 
tribution we find a scatter of (Ji os l x \m = 0-23^^% which is 
in good agreement with other studies. We fit a power law 
relation between M2500 and L x and find a best fit slope of 
a = 0.68 ±0.07 and a normalisation (for L x = 2 x 10 44 h^ 
erg/s) of M x = (1.2 ±0.12) x /z^IO^Mq. Comparison with 
other studies is complicated by the fact that the conversion of 
the masses depends on the assumed mass-concentration rela- 
tion. We find that the resu lts for the sample of massive clus- 
ters from iHoekstral ([2007) agrees well with a number of re- 
cent studies. The combination with clusters from the 160SD 
survey lowers the normalisation, which could be caused by a 
steepening of th e M-Lx rela t ion. H owever, a study of low 
mass systems by Rykoff et al. (2008a) finds a higher normal- 
isation. 

Uncertainties in the position of the cluster center, as well as 
deviations from the adopted NFW profile (e.g., substructures) 
may bias our masses low, but we estimate this t o be less than 
10%, w hich cannot explain the difference with iRvkoff et al.l 
(2008a). On the other hand, structures along the line-of-sight, 



which also may simply reflect the fact that clusters themselves 
are highly e longated, will lead to a higher normalisation for 
the study by Rvkoff et al.l (l2008al) . They binned clusters dis- 
covered in the SDSS by richness and measured their ensem- 
ble averaged X-ray luminosity and lensing mass. In this case 
the tight correlation between lensing mass and richness re- 
sults in a low average L x at a given mass. Furthermore, the 
relative importance of substructures and projections is mass 
dependent, preferentially affecting low mass systems. Conse- 
quently, both the slope and the normalization of the M-L x 
relation are affected. 

To investigate the importance of the various biases, larger 
samples of low mass clusters need to be observed. Better X- 
ray observations provide a good starting point to extend the 
mass range over which scaling relations are determined and 
to improve the interpretation of ensemble averaged samples 
of clusters. 
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APPENDIX 
MEASUREMENT OF CTE EFFECTS 

In this appendix we present the resul ts of our study of the effect of CTE on shape measurements. Similar to other studies (e.g., 
Mas sey et alj2007t iRhodes et alj[2007h we derive an empirical correction, although our actual implementation differs from these 
previous works in a number of ways. 

The CTE problem occurs during readout, and therefore the correction should be applied b efore the correction for PSF 
anisotropy. Ideally one would like to correct the images before shape measurements are done dMassey et al.ll2010l) . but for 
our purposes such a sophisticated approach is less important. Instead we will quantify the change in e\ (this is the polarization 
component that quantifies the elongation along the x— and y- direction). We note that our measurements are done on images after 
th ey have been c orrect ed for camera distortion. 

iMassev et alj d2007l) measured the effect of CTE by determining the mean galaxy ellipticity as a function of distance from 
the readout electronics. We cannot do so for a number of reasons. First, we have data from a much smaller number of images. 
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Figure 10. Panel a: CTE induced polarization at the center of the exposure (maximum distance from the read-out electronics), as a function of time. The black 
points correspond to stars with a signal-to-noise ratio of ~ 4000, whereas the red points show the trend for S/N ~ 40. The results are consistent with a linear 
increase with time. Panel b: maximum CTE induced polarization as a function of S/N ratio at the mean Modified Julian Date of our cluster observations. The 
red curve is the best fit model oc y/S/N. Note that this scaling is different from Rhodes et al] 120071) . 



More importantly, the presence of the lensing signal induced by the clusters also gives rise to a variation in e\. it is larger in the 
centre of the field and decreases towards the edge. Instead we examine observations of dense star fields. After correction for PSF 
anisotropy correction, the shape of each star provides an accurate estimate of the CTE effect, because it is intrinsically round, 
unlike a galaxy. As a result, we can measure the amplitude of CTE much more accurately. 

We use observations of the star cluster NGC104, which has been observed at regular epochs for a program to study the stability 
of ACS photometry. We select only the data with exposure times of 30s, which yields a uniform data set of 17 exposures, of 
which we omit one. These single exposures are drizzled and we measure the shape parameters from these images. The PSF 
anisotropy model for each exposure is determined using stars with F814W < 18, as the effects of CTE are expected to be small 
for such bright objects. The fainter stars are corrected for PSF anisotropy and we measure e\ as a function of y coordinate. 

We assume that the pattern is the same for each ACS chip and combine the data. We note that this assumption is actually 
supported by our measurements. For each exposure we fit a linear trend with distance from the read-out electronics 

e CTE = e max ( 3,/2048), 

where e™ x is the the maximum induced polarization. Figure [Tol a) shows the resulting value for <?'™ x as a function of time for 
stars with high (black points) and low (red points) signal-to-noise ratios. These measurements are consistent with a linear increase 
with time of the CTE induced distortion. 

As expected the CTE effects are also more pronounced for fainter objects. To investigate this trend further, we compute e 1 ™" as 
a function of signal-to-noise ratio, adopting the average Modified Julian Date of our cluster observations. The results are shown 
in Figure [lOt b). We fit the following model to our measurements 

^ = ^^(^)(MH>-52,333). (Al) 

The red line in Figure [TUfb) corresponds to the best model, for which we find e™" = (-9.07 ±0.16) x 10~ 5 . Note that we 
have assumed that the CTE effect is proportional to y/S/N, which is different from lRhodes et alJ (12007b who argue for a scaling 
oc (S/N) . The latter scaling, ho wever, is inconsistent with our findings. We have confirmed our results with other stellar fields. 
We note that Rhodes et al.] (12007b use a different shape measurement method, which may explain some of the differences. 

We now have an accurate model for the effects of CTE for stellar images, but it is not clear whether it is adequate for galaxies, 
which are more extended. Our cluster data cannot be used for this test, because the lensing signal induced by the clusters leads to 
a comparable change in ey with y position. Instead we retrieved 100 pointings taken as part of the COSMOS survey (PID:10092) 
and analysed these data in a similar fashion as our own. 

Figure QT| shows the measurement of e™ ax as a function of galaxy size r g for a signal-to-noise ratio of 20 and a mean modified 
Julian date of 53195. The points have been corrected for the small increase in mean S/N ratio as r g increases. The left-most 
point is the result derived from NGC104. We detect a clear size dependence of the CTE signal. We assume the dependence is a 
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Figure 11. Maximum CTE induced polarization as a function of galaxy size r g (for a S/N=20 and MJD=53195) based on the analysis of 100 pointings of the 
COSMOS survey. The left most point corresponds to the results from our analysis of star fields. We detect a clear dependence on size. The best fit power-law 
model is indicated by the solid line and has a slope ~ —2. 

power-law, and we find a best fit slope of -1 .85 ± 0.3; the best fit model is indicated by the solid line in Figure[TT] We therefore 
revise our PSF-based model to account for the size dependence of the CTE effect: 



where the best fit value for e™ ax = (-8.3 ±0.14) x 10 5 . Thismodelis used to correct the observed e\ for stars and galaxies in our 
data. 
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